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ABSTRACT 


A  method  of  estimating  the  magnitude-squared  coherence 
function  (spectrum)  for  zero-mean  processes  that  are  wide- 
sense  stationary  and  random  is  presented.  The  estimation 
technique  utilizes  the  weighted  overlapped  segmentation  fast 
Fourier  transform  (FFT)  approach.  Analytical  and  empirical 
results  for  statistics  of  the  estimator  are  presented  for  the 
processes.  Analytical  expressions  are  derived  in  the  non- 
overlapped  case.  Empirical  results  show  a  decrease  in  bias 
and  variance  of  the  estimator  with  increasing  overlap  and 
suggest  that  a  50-percent  overlap  is  highly  desirable  when 
cosine  (Hanning)  weighting  is  used. 
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ESTIMATION  OF  THE 

MAGNITUDE-SQUARED  COHERENCE  FUNCTION  (SPECTRUM) 
I.  INTRODUCTION 


The  complete  probability  structure  of  the  zero-mean  processes  x(t)  and 
y(t)  ,  which  are  w  (de-sense  stationary  and  Jointly  Gaussian,  is  specified  by  the 
spectral  density  matrix, 


Ml  (f)  = 

xy' 


V 


(1.1) 


where 

(J>  (f)  is  the  (real)  auto  power  spectral  density  function  of  x(t), 

/LA 

,X>yy<J)  is  the  (veal)  auto  power  spectral  density  function  of  y(t)  ,  and 
<|>  (f)  is  the  (complex)  cross  power  spectral  density  function  of  x(t) 

and  y(t)  and  consists  of  a  real  or  coincidental  (CO)  spectrum  and  an  imaginary 
or  quadrature  (quad)  spectrum. 1 

A  simplifying  ratio  is  the  complex  coherence  function  (spectrum), 


.<0  = 


*  (0 

*xyx  ' 


xy  V*  m*  m 

’  ^xx  '  yy  ' 


(1.2) 


or,  more  commonly,  the  magnitude- squared  coherence  function, 
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t 
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i 


**+***»? 
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*  (f)*>  (f) 
^xx'  '  ^yy' ' 


(1-3) 


The  term  "coherence"  can  imply  Eqs.  (1.2),  (1.3),  or  the  positive  square  root 
of  Eq.  (1.3). 

Equation  ',1. 3)  possesses  a  number  of  useful  attributes:  First,  it  always 

falls  between  zero  and  one.  Second,  it  is  zero  if  the  processes  x(t)  and  y(t) 

are  uncorrelated.  Third,  it  is  equal  to  unity  if  and  only  if  there  exists  a  linear 

2 

relation  between  x(t)  and  y(t). 

These  attributes  are  of  particular  significance  in  sonar  systems  where  a 
waveform  received  at  two  spatially  separated  elements  of  a  hydrophone  array 
*  may  be  corrupted  by  additive  noise  uncorrelated  from  the  first  to  the  second 
element. 

Unfortunately,  the  difficulty  in  estimating  the  true  coherence  has  plagued 

3  1 

modern  statisticians.  An  analytical  expression  was  derived  by  Goodman  for 

the  probability  density  function  of  the  estimate  of  magnitude  coherence  j  'y  | 

when  several  independent  observations  (or  segments)  of  the  processes  are 

available.  A  closed-form  solution  for  the  cumulative  distribution  function,  as 

a  finite  sum  of  hypergeometric  functions,  can  be  found  by  proper  identification 

4  ‘ 

of  variables  in  the  work  of  Fisher.  The  application  of  Fisher's  work  to  this 

problem  is  believed  original  in  this  thesis.  Earlier,  statistics  for  coherence 

5-7 

estimation  were  found  in  tables,  and  graphs,  and  transformations  to  be 

performed  on  the  coherence  estimator  were  suggested  so  as  to  "normalize" 

B  9 

(make  Gaussian)  the  density  function.  ’ 


Certain  empirical  studies  have  also  been  c  cm  ducted.  Haubrich  suggested 

that  the  total  time  series  under  investigation  be  segmented  into  a  number  of 

shorter  segments  overlapping  one  another  by  50  percent  and  that  a  triangular 

7 

weighting  function  be  applied  to  each  segment.  Tick  showed  empirical  exam¬ 
ples  of  the  types  of  estimates  to  anticipate  when  the  true  coherence  is  0.2  and 

g 

mean  lagged  product  techniques  are  used.  Benignus  empirically  showed  the 
bias  and  confidence  intervals  to  expect  when  n  independent  segments  are  proc¬ 
essed  using  a  rectangular  weighting  function. 

Current  techniques  for  coherence  estimation  involve  applying  the  fast 
Fourier  transform  (FFT). 11  Some  of  the  latest  published  results  on  coherence 
estimation  are  limited  in  scope  to  processes  that  have  relatively  flat  .'spectra. 10 
The  problems  associated  with  nonflat  spectra  can  be  avoided  through  judicious 
choice  of  a  time-weighting  (or  windowing)  function. 12-14  The  use  of  a  weighting 
function  is  necessary  for  data  not  spectrally  flat  and  should  be  prudently  selected 
for  unknown  data.  In  coherence  estimation,  the  application  of  a  weighting  func¬ 
tion  results  in  wasted  data  (loss  of  stability  and  increase  of  bias)  unless  over- 
14 

lapped  processing  is  employed.  In  underwater  acoustic  environments,  which 
require  weighting  functions  and  good  spectral  resolution,  but  which  remain 
stationary  only  for  limited  amounts  of  time,  such  wastage  can  not  be  permitted. 

This  thesis  empirically  determines  the  effect  of  overlap  processing  on  the 
estimated  magnitude-squared  coherence  function  when  cosine  (or  Hanning,  after 
Julius  von  Hann)  weighting  has  been  applied. 

The  empirical  method  for  determining  the  effect  of  overlap  has  been  limited 
in  scope  to  a  cosine  weighting  function,  a  finite  time  history,  and  a  desired 


frequency  resolution  (half-power)  bandwidth.  Under  these  conditions,  estimates 
of  bias  and  variance  of  the  estimate  of  magnitude- squared  coherence  have  been 
made  for  two  values  of  coherence.  The  behavior  of  these  statistics  as  a  function 
of  increasing  overlap  is  presented  and  is  believed  original. 


II.  COHERENCE  FUNCTION  (SPECTRUM)  AND  ITS  USES 

This  chapter  defines  the  coherence  function  (spectrum).  Additionally,  it 
reviews  those  terms  necessary  for  its  definition  or  helpful  in  its  estimation. 
Finally,  this  chapter  presents  some  examples  of  the  uses  of  coherence  to  lay 
a  background  for  why  this  particular  function  is  meaningful. 


II.  A.  COHERENCE  FUNCTION 

The  essense  of  the  coherence  function  is  a  collapsed  power  spectral  density 
matrix.  To  fully  appreciate  the  intricacies  of  its  definition,  it  is  first  necessary 
to  review  some  basic  concepts.  They  include  the  correlation  matrix,  wide- 
sense  stationarity,  ergodicity,  Gaussian  assumption,  and  power  spectral 
density  matrix. 

II.  A.  1.  Correlation  Matrix  and 
Wide-Sense  Stationarity 

The  general  correlation  function  between  zero-mean  processes  x(t)  and 

15 

y(t),  which  are  real  and  nonstationary,  is  defined  by  Davenport  and  Root,  as 
follows: 

Rxy(tl'  t2)  “  E  [X(tl)  y(t2)]  *  (2* X) 


5 


■fcSSXHeeOTKWcne. 


which  depends  on  the  absolute  time  instants  t^  and  t^  .  If  the  cross-correla¬ 
tion  function  depends  only  on  the  time  difference  r  -  t  -  t  and  does  not 

2  X 

depend  on  the  time  origin,  that  is,  if 


Rxy<l*  1  +  r  *  =  Rxy* T  *  “  E  [x(t*  y  <*  +  r >]  • 


(2.2) 


then  the  processes  are  called  wide-sense  stationary.  It  is  not  necessary  for 

R  ( r )  to  be  an  even  or  an  odd  function, 
xy 

Similarly,  the  autocorrelation  function  in  the  wide-sense  stationary  case 

( 

becomes  ! 


R  <r) 
xx 


E  ^x(t)  x  (t  +  r)j 


(2.3) 


which  is  an  even  function.  The  autocorrelation  function  of  the  process  y(t)  is 
similarly  defined. 

The  correlation  matrix  for  the  wide-sense  stationary  processes  x(t)  and 
y(t)  may  now  be  defined  by 


R 

xx 


( 


r) 


R  (r) 
yx' 


(2.4) 


When  two  zero-mean  random  processes  have  a  correlation  matrix  that 

depends  only  on  the  time  difference,  it  is  meaningful  to  talk  about  the  Fcorier 

16 

transformation  of  the  correlation  matrix. 


1 


l 


i 


! 
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II.  A.  2.  Ergodicity 


Random  processes  can  be  characterized  by  an  infinite  number  of  waveforms. 

Each  of  these  waveforms  is  referred  to  as  a  sample  or  member  function  of  the 
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random  process  and  is  itself  infinite  in  duration.  Statistics  of  an  order  higher 
than  the  correlation  function  can  be  computed  by  averaging  over  the  ensemble  of 
all  sample  functions.  These  statistics  can  also  be  computed  from  any  one  of  the 
sample  functions. 

If  all  the  higher  order  statistics,  when  computed  from  any  one  of  the  sample 
functions,  are  the  same  as  the  ensemble  average  over  all  the  sample  functions, 
then  the  processes  are  called  ergodic.  hi  particular,  the  correlation  matrix 
computed  over  any  one  sample  function  is  the  same  as  the  correlation  matrix 
computed  over  an  ensemble  of  sample  functions.  It  should  be  noted  that  it  is 
possible  for  the  correlation  matrix  to  be  the  same  wlen  computed  over  different 
sample  functions  and  yet  for  some  higher  order  statistics  to  differ  when  computed 
over  different  sample  functions. 

It  is  important  for  the  results  presented  in  this  thesis  that  the  correlation 
matrix  be  the  same  when  computed  over  different  sample  functions.  This,  in 
essence,  allows  the  correlation  matrix  (or  its  linear  transformations)  to  be  spec¬ 
ified  with  probability  one  from  one  sample  function.  If  the  correlation  matrix 
does  not  differ  when  computed  over  different  sample  functions,  the  processes 

are  still  called  ergodic,  but  now  some  qualifying  adjective  must  be  applied  to 
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denote  the  strength  of  the  ergodicity.  This  author  chooses  to  use  the  adjective 
"wide-sense"  to  specify  the  strength  of  the  ergodicity.  Processes  that  are  wide- 
sense  ergodic  are  also  wide-sense  stationary. 


II.  A.  3.  Gaussian  Assumption 


If  two  zero-mean  processes  are  jointly  Gaussian  and  wide-sense  stationary, 

16 

then  their  correlation  matrix  dictates  all  higher  order  statistics. 

II.  A.  4.  Power  Spectral  Density  Matrix 

Several  important  concepts  have  preceded  this  section:  First,  in  order  to 
mathematically  determine  the  power  spectral  density  matrix,  the  processes 
must  be  stationary  (in  the  wide  sense).  In  practical  estimation  situations  ergo- 
dicity  is  presumed,  and  only  one  time-limited  sample  function  is  collected  for 
each  process  under  investigation.  It  is  desirable,  but  not  necessary,  that  the 
two  processes  be  jointly  Gaussian.  When  the  two  processes  are  both  stationary 
and  jointly  Gaussian,  then  knowledge  of  their  correlation  matrix  completely 
specifies  the  statistics  of  the  processes. 

Given  the  zero-mean  processes  x(t)  and  y(t)  ,  which  are  real,  stationary, 
and  jointly  Gaussian,  a  complete  characterization  for  the  probability  structure 
of  the  processes  is  specified  in  terms  of  the  power  spectral  density  matrix 

M  (0  * 

xy 
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The  power  spectral  density  functions  composing  the  elements  of  the  power 
spectral  density  matrix  are  the  Fourier  transforms  of  the  associated  correlation 
functions.  The  cross  power  spectral  density  function  is 


^  t^f^^4ii&i&'i{Ml  Stem 


I  V03/  V,,e"i2'f'dr 


In  general,  this  function  is  complex  since  R  (  r  )  is  not  necessarily  odd  or 

xy 

even. .  Similarly, 


<J>  (f)  A  /*  R  (r)  e~i2*frdr  , 
xx  =  J  xx 

-oo 


which  is  purely  real  since  R  (r)  is  even. 

XX 


II.  A.  5.  Definition 


The  complex  coherence  function  for  two  wide-sense  stationary  processes 
is  a  normalized  complex  cross  power  spectral  density  function  given  by 


y  (f)  A 
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<$  (f) 

^ xy' 
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Since  <J>  (f)  is  complex, 
xy 


v>-  Vf,+i  v°  • 


Further,  <j>  (f)  and  <J>  (f)  are  nonnegative,  real  functions  of  f  , 
xx  yy 


<J>  <f)  >  0 
^xx  — 


(2.10) 
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and 
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(2.11) 


The  magnitude  of  the  complex  coherence  function  (or,  simply,  the  magnitude 
coherence)  is 

I VI 

-  <2  12) 
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It  follows  directly  that  the  square  of  the  magnitude  of  the  complex  coherence 
function  (or,  simply,  the  magnitude- squared  coherence)  is 
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(2. 13a) 


(2. 13b) 


Although  the  term  "coherence"  can  imply  Eqs.  (2. 8),  (2. 12),  or  (2. 13),  it 
usually  refers  to  Eq.  (2. 13). 

For  ease  of  notation,  the  dependence  on  f  is  often  not  specified;  for 
example, 
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(2. 14) 


II.  B.  USES  OF  COHERENCE  FUNCTION 


The  magnitude- squared  coherence  function  for  the  zero-mean,  wide-sense 
stationary  processes  x(t)  and  y(t)  is  useful  in  several  ways,  which  will  be 
proved  in  the  following  sections.  First,  for  two  processes  that  are  linearly 
related,  the  magnitude-squared  coherence  function  is  unity.  Second,  for  two 
independent  processes,  the  magnitude- squared  coherence  function  is  zero. 

Third,  under  the  assumptions  to  be  presented,  the  magnitude-squared  coherence 
function  serves  as  a  signal-to-ncise  measure. 

II.  B.  1.  A  Measure  of  System  Linearity 

The  magnitude- squared  coherence  function  can  be  used  to  measure  system 
12 

linearity.  In  Fig.  1  consider  the  linear  system  with  input  x(t)  ,  impulse 
response  h(  r )  t  and  output  y(t)  .  The  output  y(t)  is  expressed  by  the  con¬ 
volution  integral 


Fig.  1.  Linear  System  with  Impulse  Response  h(r) 
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The  frequency-domain  equivalent  is  a  multiplication  obtained  via  the 
Fourier  transformation: 


Y(f)  =  H(f)  X  (f) 


If  x(t)  is  a  sample  function  of  a  stations  •;  random  process,  then 


(2. 16) 


(2. 17) 


<J,yy(f)  =  H(f)  H*(f)<*»xx(f)  =  H(f)cJ>*y<f)  . 


Since  the  magnitude-  "uared  coherence  defined  by  Eq.  (2. 13)  can  be 


(2. 18) 


written  as 
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application  of  Eqs,  (2. 17)  and  (2. 18)  yields 


7  (f)  2  =  H(f)  ~=l,yf 
Xy  H(f) 


(2.20) 


Consequently,  the  magnitude-squared  coherence  between  the  input  and  output  of 
a  linear  system  is  unity. 


13 


II.  B.  2.  A  Measure  of  Correlation 


If  the  zero-mean  processes  x(t)  and  y(t)  are  independent,  they  are  also 
uncorrelated  and  orthogonal;  that  is 


R  <r)  =  E  [x(t)y(t  +  r )]  =  E [x(t)]  E  [y(t  +  r  )]  =  0  ,  (2.21} 

xy 


/  RyJr)e"j2,rfTdr  -  0  , 


<t>  (f)  =  , 
xyv  J_m  xy 


(2.22) 


and 


y  (0 
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=  0  ,  V  f  . 


(2.23) 


Hence,  if  the  two  processes  are  independent  or  uncorrelated  with  zero  mean, 
the  magnitude-squared  coherence  between  them  is  zero. 

II.  B.  3.  A  Measure  of  Signal-to-Noise  Ratio 

Consider  a  signal,  s(t)  ,  passed  through  two  linear  filters  and  received 
at  two  sensors  where  it  is  corrupted  by  uncorrelated  additive  noises.  The 
received  waveform  at  each  sensor  is  then  passed  through  two  linear  filters,  as 
shown  in  Fig.  2. 


Assume  that  s(t)  ,  n  (t)  ,  and  n  (t)  are  uncorrelated;  that  is, 

X  It 


E  ^(t)  ng(t+  t  )j  =  o  , 

(2.24) 

E  £n^(t)  s(t  +  r)  j  =0  , 

(2. 25) 

Fig.  2.  Signal  s(t)  as  Received  at  Two  Sensors 
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(as,  for  example,  if 


H  (f)  =  e~^2*f  r  a 


(2. 33) 


and 


=  e~j2,rfTb 


(2.34) 


corresponding  to  time  delays  and  c?  a  directional  signal),  then 
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Second,  when 


nx(t)  =  0  , 
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and 


III.  COHERENCE-ESTIMATION  PROCEDURE 


The  procedure  for  estimating  the  coherence  or  magnitude-squared  coherence 
functions  for  wide-sense  ergodic  (and,  hence,  wide-sense  stationary),  zero- 
mean  random  processes  x(t)  and  y(t)  is  discussed  in  this  chapter.  (References 
within  this  chapter  to  x(t)  and  y(t)  apply  to  those  specific  processes  with  the 
noted  characteristics,  that  is,  zero-mean,  wide-sense  ergodic. )  The  basic  ob¬ 
jective  is  to  obtain  estimates  of  the  elements  of  the  spectral  density  matrix, 


r*  (f>  $  (f) 

XX  *xy 
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in  order  to  form  the  magnitude- squared  coherence  estimator. 

The  estimation  procedure  described  is  the  direct  method,  which  is  discussed 
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in  part  by  Welch,  Knapp,  Bingham,  ’  Benignus,  Nuttall,  and  Carter 
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and  Arnold.  It  includes  cosine  weighting  and  overlapped  processing  and  is 
used  because  of  the  computational  advantage  of  the  FFT.11 

Briefly,  the  method  implemented  consists  of  obtaining  two  finite-time  series 
from  the  random  processes  being  investigated.  The  time  series  are  segmented 
into  n  segments,  each  having  P-data  points.  For  example,  from  each  process 
there  may  be  32  segments,  each  segment  having  4096  points.  The  segments  may 
be  overlapped  or  disjoint.  Each  segment  is  multiplied  by  a  weighting  function, 
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-  - - . — 


and  the  FFT  of  the  weighted  P-point  sequence  is  performed.  The  Fourier 
coefficients  for  each  weighted  segment  are  then  used  to  estimate  the  elements 
of  the  power  spectral  density  matrix.  The  power  spectral  estimates  thus 
obtained  from  each  set  of  weighted  sequences  are  then  averaged  over  all  the  n 
segments.  Next,  the  resultant  estimates  are  used  to  form  the  magnitude- 
squared  coherence. ^ 


III.  A.  QUANTIZED  SEQUENCE  OBTAINED 
FROM  WEIGHTED  SEGMENT  OF  DATA 


Consider  the  time-limited  sample  functions  of  processes  x(t)  and  y(t) 
(specified  in  Chapter  III. ).  Let  the  sample  functions  be  further  constrained  so 
that  they  have  the  same  bandwidth.  This  may  come  about  as  a  result  of  (1)  the 
physics  of  the  experiment,  (2)  the  bandpass  characteristics  of  some  recording 
device,  or  (3)  the  intentional  introduction  of  bandpass  or  low-pass  filters  to 
prevent  aliasing.  Analog  to  digital  (A/  D)  conversion  of  the  signals  is  now  accom¬ 
plished  by  sampling  the  two  analog  signals  at  a  frequency,  f  Hz,  greater  than 

s 

twice  the  bandwidth  of  the  signals.  This  technique  yields  two  quantized  se¬ 
quences  of  numbers  or  time  series.  The  quantization  error  decreases  as  the 
number  of  bits  in  the  quantizer  increases.  (Errors  as  a  result  of  quantization 
are  beyond  the  scope  of  this  work. ) 

Let  these  two  time  series  from  processes  x(t)  and  y(t)  ,  which  are  drawn 
continuously  for  convenience,  be  depicted  as  in  Fig.  3. 

The  method  of  overlapped  weighted  segmentation  requires  that  before 
estimating  the  coherence  between  x(t)  and  y(t)  ,  both  x(t)  and  y(t)  be 
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Fig.  3.  Two  Time  Series  from  Processes  x(t)  and  y (t) 


multiplied  by  a  series  of  real  weighting  functions,  w  (t)  ,  or  sampled  and 

s 


quantized  versions  thereof,  as  in  Fig.  4. 


",  (0 


/v 


-»- 


^  total 


w2(t) 


w,  (t) 


•  a+T 

t 


-» 


•  ^total 


-»- 


JX  r 

(s-l)a  (s-l)a+f 


total 


wn(0 


■» 


A 


(n-l)a  (n-1)a+T 
^total 


22. 


Fig.  4.  Overlapped  Weighting  Functions  (Modified  from  Knapp  ) 

The  method  implemented  computes  a  P- point  discrete  Fourier  transform 
(DFT)  for  each  of  the  weighted  segments.  The  frequency-domain  equivalent  of 
multiplying  each  segment  by  a  weighting  function  is  a  convolution  of  the  true 
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spectrum  with  the  Fourier  transform  of  the  weighting  function.  Hence,  the 
weighting  function  should  be  judiciously  selected  in  order  that  the  true  spectrum 
be  least  distorted. 


The  factors  affecting  the  selection  of  the  segment  length  and  window  shape  are 

1.  w  (t)  should  be  relatively  easy  to  compute. 

s 

2.  Ttotaj  should  be  large  in  order  that  the  amount  of  averaging  be 

"T  ' 

sufficient  to  reduce  th 5  bias  and  variance  of  the  spectral  estimates. 

3.  d°  w  (t)  should  be  continuous  for  n  =  0 ,  1 ,  2 ,  ...»  up  to  some 
*  ’  s 

dtn 

reasonable  limit,  since  this  ensures  that  the  sidelobes  of  the  Fourier  transform 


of  w  (t)  die  off  rapidly, 
s 

4.  The  Fourier  transform  of  w  (t)  should  also  be  narrow  in  the  main  lobe 

S 

(narrower  than  the  finest  detail  of  the  true  spectral  density  matrix  of  processes 
x(t)  and  y(t)).  Generally,  this  lobe  is  narrowed  by  increasing  T. 

The  specific  selection  of  a  weighting  function  involves  a  number  of  tradeoffs. 


Therefore,  for  example,  if  T  =  1  and  a  =  £ » then  p^  =  50  percent.  Whereas 
if  T  =  1  and  a  =  3/4,  then  pQ  =  25  percent. 

Note  that  if  a  >  T  there  would  be  no  overlap,  and  each  segment  would  be 
virtually  independent  of  the  previous  one  (except  for  correlated  edge  effects). 

All  theoretical  results  here  are  concerned  with  the  case  of  independent  segments, 


that  is,  no  overlap.  A  detailed  analysis  of  the  effect  of  overlapped  weighted 
segmentation  for  estimating  auto  power  spectral  density  functions  is  given  by 
Nuttall.  ** 


in.  B.  COHERENCE  ESTIMATOR 


Let  x  ,  where  p  =  0,l,2,...,  P-1  denote  the  P-point  sequence 
sp 

obtained  from  the  sth  weighted  segment  of  process  x(t) .  In  estimating  the 
coherence  function,  it  is  necessary  to  evaluate  a  transformation  of  this  weighted 
sequence.  The  FFT  is  a  fast  algorithm  for  evaluating  a  special  case  of  the 
Z-transform  of  a  finite  sequence  of  numbers.  The  two  sided  Z-transform  of 
an  infinite  sequence  is  defined  by 


V2)=E  ’V'p  - 

p  =  -oo 


where  7.  equals  any  complex  variable. 

Evaluation  of  the  Z-transform  at  P  equally  spaced  points  around  the  unit 
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circle  for  a  P-point  sequence  yields  the  P-point  DFT: 
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This  fact  is  so  basic  that  it  is  often  not  discussed.  However,  it  points  out 


that  regardless  of  the  value  of  the  true  magnitude-squared  coherence,  I  y  I  =  1 
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when  n  =  1 .  Consequently,  the  estimate  le,  in  general,  biased;  the  actual  bias 
depends  on  |  y  | 2  and  a  .  In  practice,  n  should  be  large,  as  will  be  shown. 


A 


TV.  STATISTICS  OF  ESTIMATE  OF  COHERENCE 

Goodman,  in  his  Eqs.  (4. 51)  and  4. 60),  *  derived  an  analytical  expression 
for  the  probability  density  function  of  the  magnitude-coherence  estimate,  |  Y*  I  , 
based  on  Eqs.  (3. 11)  and  <3. 12).  His  results  were  based  on  two  zero-mean 
processes  that  were  stationary,  Gaussian,  and  random  and  had  been  segmented 
into  n  independent  observations  (that  is,  nonoverlapped  segments).  Each  seg¬ 
ment  was  assumed  large  enough  to  ensure  adequate  spectral  resolution.  Further, 
each  segment  was  assumed  perfectly  weighted  (windowed),  in  the  sense  that  the 

Fourier  coefficient  at  some  kth  frequency  was  to  have  "leaked"  no  power  from 
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other  bins.  However,  Hannan  points  out  that  the  statistics  do  not  hold  at  the 
zeroth  or  folding  frequencies. 

The  material  in  this  chapter  relating  to  magnitude-squared  coherence  is 
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believed  to  be  new  (Carter  and  Nuttall  )  and  is  a  direct  extension  of  Goodman’s 
work.  *  All  of  Goodman's  original  assumptions  hold.  Statistics  of  the  magnitude- 
coherence  estimator  are  given  in  Appendix  A. 


IV.  A.  PROBABILITY  DENSITY  AND  CUMULATIVE 
DISTRIBUTION  FUNCTIONS 

The  first-order  probability  density  and  cumulative  distribution  functions  for 
the  estimate  of  magnitude- squared  coherence,  given  the  true  value  of  magnitude- 
squared  coherence  and  the  number,  n  ,  of  indeponds*^  segments  processed, 
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are  presented  in  closed-form.  The  expressions  are  evaluated  and  plotted. 

IV.  A.  l.  Probability  Density  Function 

The  conditional  probability  density  function  for  the  estimate  of  magnitude- 

/\i  2  I  |  2  9 

squared  coherence,  7  ,  between  two  processes,  given  j  7  |  and  n  ,  is 

p(l- 12|"- M2)  -fr-4-Mff-Mf* 

nil!  I*  I*  I>'|2)  ,  OS  |t|2|/y|2  <  1  (4.1) 

It  then  follows,  knowing  |  ^  |  «  0*1  2]  ,  that 

n.  I7  |)  =  p(l7i  2|n*l7  I2  )  2I^I  *  <4-2> 

Equation  (4.2)  can  be  shown  (Appendix  A)  to  be  Goodman's  result.*  The 

26 

density  function,  Eq.  (4. 1) ,  can  be  rewritten  using  Eq.  (15.35)  of  Abramowitz 
in  the  following  alternate  forms: 

p(l^|2|  n»l7  |2)=  (n-  l)(l-|7  |2)  (l-l7!2) 

*  (i-M2!?!2)  2F^l-n,l-n;l;j7|2|7'|2)  (4.3) 


and 
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Equations  (4. 3)  and  (4. 4)  are  desirable  because  (1  -  n ,  1  -  n ;  1 ;  1 7 1  |  7  j  J 
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can  be  expressed  as  an  (n  -  l)st  order  polynomial  (Abramowitz,  Eq.  (15. 4.1)  ). 


A  special  case  of  the  density  function  occurs  when  |  7  |  =0.0  .  In  that  event, 
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(4.5) 


IV.  A.  2.  Cumulative  Distribution  Function 

4 

Fisher,  working  on  statistics  of  the  estimate  of  the  squared  correlation 
coefficient,  derived  the  probability  density  for  that  random  variable.  He  inte¬ 
grated  the  result  and  achieved  a  closed-form  solution  for  the  cumulative  distri¬ 
bution  function;  specifically  the  solution  was  a  finite  sum  of  F  functions, 

&  1 

each  one  a  finite-order  polynomial.  Although  these  statistics  are  for  a  cuferent 
problem,  proper  identification  of  variables  yields  exactly  the  integration  for¬ 
mula  needed  to  find  the  cumulative  distribution  of  the  estimate  of  magnitude- 
squared  coherence,  namely, 


~4 
'  '* 


ijft  -W  »*.-f*y**.A* 
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V 


Fig.  5,  Summary  Flow  Chart  for  Density 
and  Cumulative  Distribution 
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cumulative  distribution,  Eq.  (4. 6),  can  be  similarly  evaluated.  The  density 
function  and  the  cumulative-distribution  function  were  computed  as  illustrated 
in  Fig.  5  for  ten  values  of  |  y  P  and  for  n  =  32,  40,  48,  56,  and  64.  (The 
computations  and  100  plots  were  done  on  the  UNIVAC  1108  in  less  than  5  min¬ 
utes.)  Example  plots  are  included  in  Figs.  6  through  13. 

One  example  of  how  these  plots  can  be  used  is  as  follows:  Magnitude- 
squared  coherence,  |  ^  ,  is  estimated  by  averaging  over  32  disjoint  seg¬ 

ments  of  data  (that  is,  n  =  32).  Suppose  the  estimated  value  is  approximately 
0.3,  then  from  Fig.  7 


(4.9b) 


Equation  (4.9b)  could  be  set  equal  to,  for  example,  0.9,  and  the  value  of 
L,  from  Fig.  6,  is  0.2. 

The  upper  limit  is  found  from 


which  could  be  set  equal  to,  for  example,  0.9,  and  the  value  of  A ,  from  Fig.  7, 
is  0.43.  Hence, 


o 


00 


unsnusia  3A?r 


Fig.  13.  Probability  Density  and  Cumulative  Distribution  Functions  for  |4|2  Given  n  =  64  and  j‘y|2  =  0. 9 
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.p!2|„  =  32,H2  =  0.3)d|<M2  = 


=  0,8  . 


(4.11) 


On  the  basis  of  Eq.  (4.11),  the  probability  that  the  estimator  falls  in  the 
range  (0. 2,  0.43)  is  0. 8,  given  that  the  exact  value  of  the  unknown  parameter 
was  0. 3  and  that  32  disjoint  segments  were  used. 

Proper  use  of  the  cumulative  distribution  function  yields  confidence  intervals 
for  the  estimate  of  magnitude-squared  coherence  or  any  ’’one  for  one"  trans- 

/  M2  \ 

formation  of  it,  such  as  the  positive  square  root  or  10  log  I  — rrr  I .  (See, 

10  \  i  ~J y| z  ' 

27  25  » *  I 

for  example,  Cramer  or  Carter  and  Nuttall.  ) 


IV.  B.  mth  MOMENT  OF  DENSITY  FUNCTION 


The  mth  moment  of  the  magnitude- squared  coherence  is  given  by 


[(flTI-  M2)  W-fl*' 

(n- l)(l-|T|2)”/  (l-l^l2)"2  2F1(n,n:l;h|2l^l2) 


^  2m  d  ^  2 


where  use  has  been  made  of  the  density  function,  Eq.  (4.1) . 


Application  of  Eq.  7. 512(12)  by  Gradshteyn  yields 


(4. 12) 
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I7^rL 


(' 


*3F2  *  m  +  1’  n»  n;  m  + 


n,  1;  |r  |2)  .  (4.13) 


The  three-two  hype rgeome trie  functions  denoting  three  numerator  terms 
and  two  denominator  terms  are  given  by 


®  (a)k(b)k(c,k  z 

F  (a,b,c;  d,e;z)=£] - »  (4.14) 

o  Z 


k=0  (d)k(e)k 


,26 


kl 


where  the  (a)^  notation  is  Pochhammer’s  symbol  defined  by 


(a)k  *  • 

r(a) 


(4.15) 


The  mth  moment  for  the  estimate  of  magnitude-coherence  is  given  in  Appendix 
A. 

These  results  can  be  verified  through  proper  identification  of  variables  in 
the  work  of  Anderson,29  who  extended  Fisher's  original  work‘d  on  the  squared 
correlation  coefficient. 

IV.  C.  BIAS  AND  VARIANCE 

|/\|  2 

This  section  deals  with  the  bias  and  variance  of  the  estimator  |  7  ( 

Exact  and  approximate  expressions  are  presented.  In  addition,  computer 
evaluation  of  the  exact  expressions  is  presented  to  lend  meaning  to  these  results. 


/  1 


I 
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IV.  C.  1.  Bias 


Consider  now  the  first  moment  of  the  probability  density  function  for  the 
estimate  of  magnitude-squared  coherence.  This  moment  can  be  written 


l^l  9  1  1  15 

a  (t-W2 

■>" 

/  |  io 

\ 

E  N2|  n,  |  7 1 
- 

r 

J  n 

JL  i 

3 

Fj2,n,n;  n  +  l,l;|7| 

J  .  (4.16) 

If  |7|  =1.0,  gFg  =  oo therefore,  the  evaluation  of  Eq.  (4.16)  is  not 


meaningful.  When  |  7j  =Q.O',  gF  =1.0,  which  yields 


tedious  manipulation  of  Eq.  (4. 16)  (Appendix  B)  yields  the  simpler  result: 


fFi(i.x,«  +  »,|1r|*).  <4.: 


An  extremely  useful  approximation  can  be  made  by  expanding  Eq.  (4. 18)  to 
obtain 


e(l?l2|n.H-W+— H2* 1  ■’  (|v |»V 

'  'nn+1  (n  +  l)(n  +  2)  '  ' 


(n  +  l)(n  +  2)(n  +  3)  '  ' 


(4.19) 


Computation  of  higher  order  approximating  polynomials  is  also  easily  performed 
and  is  based  on  an  analytical  expression  for  E  (PHn.hl2). 


* 

mmmuhm 


The  bias  or  expected  estimation  error  is  defined  as 

12 


Bias^  E  (l^|  2|  n ,  |  Y  i2)  -  | 
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(4.20) 


From  Eq.  (4.18),  an  exact  expression  for  the  bias  is 


Bias 


Sl  +  *Z±  |7|2  F  /lf  ijn  +  2;  hi2)  -  \j\l 
n  n  +  1  ' 


(4.21) 


Expanding  Eq.  (4. 21)  gives  the  approximation 


B  SI.-2_|r|2  +  -iJJa^Jl(|T|2)2  + - tizJUU — LA3, 

°  n  n  +  1  (n  +  l)(n  +  2)  '  ’  (n  +  l)(n  +  2)(n  +  3) '  ' 


Bias  - 


B 


B  >0 
o  ~ 


B.  <  0 
o 


(4.22a) 

(4.22b) 


As  an  example  of  using  this  approximation  for  a  =  8 ,  the  exact  bias  lies  in  the 

range  (0.0,  0.125),  depending  on  |yj2  ;  and  the  maximum  difference  between 

2 


Eqs.  (4. 21)  and  4. 22)  is  0. 0027  at  { 7  j  =0.86.  For  large  n,  Eq.  (4.22a 
and  b)  reduces  to 

Bias  3  -  \  1  -  |7|  I  .  (4. 22c) 


•  M-H')’. 


It  should  be  noted  (see,  for  example,  Eqs.  (4. 22a)  and  (4. 22b))  that 

Urn  (Bias)  =  0  ; 
n — -oo 

therefore,  the  estimator  may  be  referred  to  as  asymptotically  unbiased. 


(4. 23) 
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An  empirically  determined  bias  was  found  by  Benignus*0  to  be 

Bias  =  —  <1  -  j > | 2)  ,  (4.24) 

n 

which  fits  the  true  curve  for  j  7  |  ^  =  0.0  and  1 7^  =  1.0  . 

It  is  suggested  that  the  higher  order  polynomial  expression  for  bias, 

Eq.  (4. 22c),  analytically  derived,  be  used  (as  opposed  to  Benignus*  result, 10 
Eq.  (4. 24)),  especially  for  small  n.  However,  it  can  be  shown  that  Benignus' 
result  is  an  upper  bound  on  the  bias  for  any  n. 

I/S  j  2 

7  j  owing  to  insufficient  sfjectral  resolving 

o 

powerifor  example,  FFT  too  small)  is  given  by  Jenkins  and  Watts,  but  is 
beyond  the  scope  of  this  thesis.  The  formula  for  bias  derived  above  assumes 
sufficient  resolving  power. 

IV.  C.  2.  Variance 

The  variance  of  the  estimator,  namely,  the  second  moment  about  the  mean, 
is  given  by 


Variance  s  V 


4^Y]-  [E(i?i2)]2  • 


(4. 25) 


The  second  moment  of  the  density  function  is,  as  a  consequence  of  Eq.  (4. 16), 

An 


3 Fg  (3,  n,  n;n+2,  1;  |y|2 )  •  (4.26) 


When  7  =0.0,  Eq.  (4. 26)  yields  the  result 


ffiSSSOBSEBSB35BS®®33 


l-'*-  ‘^SSSSSXftSSSSs^aatmi^aa 
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l(i^12)2U.M2=o-o]=  — 

1'  ’  J  n(n  -f  1) 


(4.27) 


An  exact  expression  for  the  variance  of  l^j2  is  obtained  from  Eqs.  (4. 16), 
(4.25),  and  (4.26).  The  result  is 

V=  3f2(3,».nin+2.  l;|>|2) 

n(n  + 1)  '  ' 

.  JLhll  3F2^2,n,n;n  +  l,l;!7|2) 

12 

For  the  special  case  of  7  |  =  0.0  , 


(4. 28) 


V  = 


n(n  + 


1)  W  n  (n  +  1) 


7  =0.0 


(4. 29a) 


and 


,  for  large  n  and  7  2  =0.0  . 


(4.29b) 


In  order  to  avoid  tedious  and  srror-prone  hand  manipulation,  computer- 

30 

aided  formula  manipulation  of  Eq.  (4. 26)  was  used  to  yield  an  approximation 

012 


for  the  variance  of  7  .  The  result  is 


*5 
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[l  +  2"JL!  |Tp  .jJnLnL^U.  (up) 

n(n  +  1)  In  n  +  2  (n  f  l)(n  +  2)(n  +  3)  '  ' 


+  2  n4  -  6n3  -  n2  +  lOn  -8  /L  |2 
(n  +  l)(n  +  2)  (n  +  3)(n  +  4)  ' 


y 


+  13n5-  15n4-  113n3  +  27n2  +  136n-  120  /  !  ,  |2\4 
(n  +  l)(n  +  2)2  (n  +  3)(n  +  4)(n  +  5)  V  ' 


(4.30a) 


or 


V  = 


V  ,  V  >  0 
o  o  — 


0  ,  V  <  0 
o 


(4.30b) 


As  an  example  of  using  the  approximation  given  by  Eq.  (4. 30)  for  n  =  8 , 

the  exact  variance  lies  in  the  range  (0.0 ,  0.031) ,  and  the  maximum  error  due 
/*  I  1 2 

to  the  approximation  in  Eq.  (4.30)  is  0.0067  at  7  j  =0.83,  This  result  is 

g 

a  generalization  of  the  third-order  approximation  by  Jenkins  and  Watts,  which 

\  /\  i  2 

has  no  zeroth  order  term;  that  is,  it  assumes  no  variance  of  j  7  |  when 

I2  =  0.0. 


In  particular,  for  large  n  and  J  y  |  2  *  0  ,  Eq.  (4. 30)  reduces  to 


V  =  —  y  *  2 


(4.  31) 


which  has  a  maximum  value  of  8/27n  at  7  =  1/3 


A«>4*  it 


Hence,  the  variance  of  the  estimator  in  the  case  where  |  7  j 2  is  unknown 
(but  nonzero)  decreases  inversely  proportional  to  n  . 

IV.  C.  3.  Digital  Computer  Evaluation 
Of  Bias  and  Variance 

Practical  experience  in  estimating  the  magnitude- squared  coherence 

function  leads  one  to  anticipate  certain  bias  and  variance  problems.  For  a 

i  1 2 

given  number  of  segments,  n,  when  |  7  =1.0,  neither  a  bias  nor  a 

variance  problem  exists;  however,  when  |  7  | 2  =  0.0 ,  the  average  value 

II  2 

7  j  is  about 

0.3  to  0. 4  ,  the  variance  of  the  estimator  appears  much  greater  than  when 
J  7  | 2  =0.0.  Primarily  because  this  and  the  behavior  of  the  estimator  with 
increasing  n  can  not  be  readily  sensed  from  Eqs.  (4.21)  and  (4.28), 
a  computer  program  has  been  written  to  evaluate  and  plot  those  functions  (see 
Fig.  14).  The  results,  Tables  1  through  5  and  Figs.  15  through  26,  dramat¬ 
ically  portray  the  behavior  of  these  complicated  (but  readily  evaluated)  functions. 
Tables  and  graphs  of  this  type  have  been  prepared  in  the  past  by  Amos  and 
Koopmans.5 
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TABLE  1 


A  ,2 


BIAS  AMD  VARIANCE  OF  M  FOR  n  =  32 


171“ 


E<l7l2) 


Bias 


E[($|2)]: 


Variance 


•UUuOu 
.40000-01 
.00000-01 
, 12U00+00 

.16000 too 
•20UOO+00 
.24000+00 
,28u00+00 
.32000^-00 
.360004-00 
.40000+00 
.44000+00 
,481)00+00 
.52U00+00 
,66000+00 
•60O00+00 
.64000+00 
.68000+00 
•72000+00 
.76000+00 
.60000+00 
•84000+00 
.88000+00 
.92000+00 
.96000+00 
.10000+01 


.31260-01 

.66870-01 

.16658+00 

.1+438+00 

•ld227+00 

.22025+00 

.26832+00 

.29648+00 

,33474+00 

,3/309+00 

.41153+00 

.43007+00 

.46871+00 

.52744+00 

.5o6£7+00 

.60519+00 

.64422+00 

«6o334+00 

•76256+00 

•7old9+00 

,60131+00 

.6+084+00 

.86047+00 

.92021+00 

.96005+00 

.10000+01 


31250-01 

28670-01 

26579-01 

24376-01 

22267-01 

20247-01 

16316-01 

16482-01 

14738-01 

13068-01 

11533-01 

10072-01 

87068-02 

74379-02 

62661-02 

51920-02 

42165-02 

33402-02 

25640-02 

18686-02 

13147-02 

84326-03 

47491-03 

21019-03 

48847-04 

00000 


,18939-02 
,75808-02 
.15823-01 
,26654-01 
, 40110-01 
.56227-01 
.75041-01 
,96590-01 
.12091+00 
,14805+00 
,17803+00 
.21091+00 
,24672+00 
.28551+00 
.32732+00 
,37220+00 
,42018+00 
,47132+00 
,52566+00 
.58326+00 
,64416+00 
,70841+00 
,77606+00 
.84717+00 
,92180+00 
.10000+01 


.91738- 

.28377- 

.44634- 

.56089" 

.68888- 

.77162- 

.83127- 

.86886" 

.88624- 

.88513- 

,86732- 

.83462- 

,78895- 

.73225- 

.66654- 

,59391- 

.51651- 

,43659- 

.35644- 

,27845- 

.20509- 

.13891- 

.82562- 

,38801- 

.10573- 

.00000 


>03 

•02 

•02 

02 

02 

•02 

•02 

•02 

•02 

02 

02 

02 

02 

02 

•02 

>02 

>02 

02 

>02 

>02 

02 

02 

>03 

03 

03 


A. 
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TABLE  2 


BIAS  AND  VARIANCE  OF  |4 1 2  FOR  n  =  40 


in2 

E(|4l2) 

Bias 

E  [( l$|2)]2 

Variance 

00000 

•25000-01 

,25000-01 

.12195-02 

59451-03 

40000-01 

•63085-01 

•23085-01 

,61677-02 

21880-02 

60000-01 

.11)124+00 

,21243-01 

•13784-01 

35342-02 

12000+00 

.13947+00 

•19474-01 

,24099-01 

46458-02 

16000+00 

.17778+00 

•17779-01 

•37141-01 

55357-02 

20000+00 

•21616+00 

,16158-01 

•52942-01 

62174-02 

24000+00 

.23461+00 

•14612-01 

•71532-01 

67043-02 

28000+00 

•29314+00 

•13141-01 

,92942-01 

70106-02 

32000+00 

.35174+00 

•11745-01 

•11721+00 

71507-02 

36U00+00 

,37042+00 

•10425-01 

,14435+00 

71391-02 

40000+00 

.40918+00 

•91808-02 

,17442+00 

69912-02 

44000+00 

•4h801+00 

.*0137-02 

,20744+00 

67223-02 

46000+00 

•46692+00 

•69239-02 

•24344+00 

63484-02 

52000+00 

•52591+00 

•59117-02 

•28247+00 

56859-02 

56000+00 

•58498+00 

•49776-02 

,32455+00 

53515-02 

60000+00 

•60412+00 

•41222-02 

.36973+00 

47625-02 

64000+00 

•6+335+00 

•33459-02 

,41803+00 

41364-02 

68000+00 

•66265+00 

•26490-02 

,46950+00 

34916-02 

72000+00 

,72203+00 

.20322-02 

,52418+00 

28465-02 

76000+00 

,76150+00 

•14960-02 

,56210+00 

22204-02 

80000+00 

,60104+00 

•10408-02 

,64330+00 

16329-02 

64000+00 

,64067+00 

•66704-03 

•70783+00 

11044-02 

aaooo+oo 

•86038+00 

•37527-03 

,77572+00 

65547-03 

92000+00 

,92017+00 

• 16560-03 

,64701+00 

30789-03 

96000+00 

•96004+00 

•37304-04 

,92176+00 

84645-04 

10000+01 

,lw000+01 

Hi 

,10000+01 
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TABLE  3 

BIAS  AND  VARIANCE  OF  |7|2  FOR  n  = 


l7l2 

E(i4l2) 

Bias 

.00000 

,2u833-01 

.20833-01 

.40000-01 

.59231-01 

.19231-01 

.60000-01 

,9/byi-0i 

.17091-01 

•12u00+00 

.13621+00 

•16213-01 

.16000+00 

,17480+00 

.14797-01 

.20000+00 

.2x344+00 

.13443-01 

•24000+00 

.28215+00 

.12153-01 

.26000+00 

,2*093+00 

.10926-01 

•32000+00 

,3^976+00 

.97616-02 

.36000+00 

,3o866+00 

.86615-02 

.40000+00 

.40763+00 

.76254-02 

,44000+00 

.4*665+00 

•66538-02 

.48000+00 

,4ob75+00 

.57469-02 

•52000+00 

.52491+00 

,49051-02 

.  9 

.56000+00 

.56413+00 

.41286-02 

.60000+00 

.60342+00 

.34179-02 

.64000+00 

,6+277+00 

.27732-02 

,66000+00 

,66219+00 

.21948-02 

.72000+00 

, 72168+00 

•16832-02 

.76000+00 

.76124+00 

•12386-02 

,80000*00 

,80086+00 

.86127-03 

.64000+00 

.84055+00 

.55169-03 

,88000+00 

.86031+00 

•31001-03 

.92000+00 

.92014+00 

.13637-03 

•96000+00 

,9o003+00 

.29739-04 

•10000+01 

,10000+01 

.00000 

A|2n2 


E[<tfh] 
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Variance 


.41631-03 

,17769-02 

.29247-02 

.36710-02 

.46273-02 

,52054-02 

.56175-02 

.58758-02 

,59929-02 

.59816-02 

,58552-02 

.56269-02 

,53106-02 

,49201-02 

.44700-02 

,39746-02 

,34491-02 

,29088-02 

.23691-02 

.16462-02 

.13565-02 

,91652-03 

,54359-03 

,25541-03 

.71213-04 

,00000 


TABLE  4 


BIAS  AND  VARIANCE  OF  |7|2  FOR  n  =  56 


•00000 

.40000-01 

•80000-01 

•12000+00 

.16000+00 

•2QUO0+00 

•24000+00 

•28000+00 

•32000+00 

•36000+00 

•40000+00 

•44000+00 

•48000+00 

•52000+00 

•56000+00 

•60000+00 

•64000+00 

•68000+00 

•72000+00 

•76000+00 

,80000+00 

•84000+00 

•88000+00 

•92000+00 

•96000+00 

•10000+01 


E(i4i2) 

.1/857-01 

•5o460-0l 

.96157-01 

•16369+00 

•1/267+00 

.21151+00 

•250<*0+00 

•26935+00 

•32835+00 

•3o741+00 

•40652+00 

•4+569+00 

•46491+00 

.52419+00 

•5o353+00 

•60292+00 

•64237+00 

•66167+00 

•72144+00 

•7ol06+00 

•80073+00 

•84047+00 

•66026+00 

•92012+00 

•96002+00 

,10000+01 


.17857-01 

•16480-01 

•15157-01 

•13887-01 

.12671-01 

•115S0-C1 

•10402-01 

•93494-02 

•83514-02 

,74084-02 

•65206-02 

•56884-02 

•49119-02 

.41913-02 

•35270-02 

•29190-02 

•23678-02 

•18735-02 

•14364-02 

•10566-02 

•73449-03 

•47025-03 

•26403-03 

•11579-03 

•24268-04 

•00000 


E[(l$l2)]2 

,62657-03 

,46844-02 

•11549-01 

,21244-01 

•33791-01 

,49213-01 

.67535-01 

,88780-01 

.11297+00 

•14014+00 

•17030+00 

,20348+00 

.23970+00 

,27900+00 

•32140+00 

,36692+00 

,41559+00 

,46744+00 

,52250+00 

,58079+00 

,64234+00 

•70717+00 

,77533+00 

•64683+00 

•92171+00 

,10000+01 


Variance 

•3076^-03 

,14944-02 

,24943-02 

.33178-02 

•39751-02 

.44769-02 

•48339-02 

•50571-02 

,51577-02 

,51470-02 

.50366-02 

,48583-02 

,45642-02 

,42264-02 

,38376-02 

,34103-02 

•29575-02 

•24926-02 

,20288-02 

,15800-02 

•11601-02 

•78333-03 

,46438-03 

.21832-03 

*61738-04 

•00000 


53 


TABLE  5 


BIAS  AND  VARIANCE  OF  1 7  I2  FOR  n  =  64 


.0QU00 

.40000-01 

.aouuu-oi 

.12U00+00 
,16000+00 
.20000+00 
.24000+00 
•26000+00 
.32000+00 
•  3t>UUp+00 
•40000+00 
.44000+00 
•46000+00 
•52000+00 
•56O00+00 
.60000+00 
.64000+00 
•68000+00 
•72000+00 
,76000+00 
•60000+00 
•64000+00 
•68000+00 
•92000+00 
•96000+00 
.l0U00+0i 


E(j$l2) 

•1 j62b-01 
, 5*416-01 
•9o2b8-01 
•16214+00 
,1/106+00 
.2x006+00 
.2+909+00 
•2o8l7+00 
. 3*730+00 
•3ob47+00 
•4u»70+00 
,4+497+00 
•46429+00 
•523o6+00 
•55306+00 
•6U255+00 
,6+207+00 
•6ol63+00 
,7«jl25+00 
,7o092+00 
•60064+00 
•8+041+00 
,86023+00 
•92010+00 
•9o002+00 
•luOOO+Ol 


Bias  E[(|'y|2)]2  I  Variance 


15625-01 

14418-01 

13258-01 

12145-01 

11080-01 

10062-01 

90924-02 

81707-02 

72971-02 

64720-02 

56955-02 

49676-02 

42687-02 

36589-02 

30763-02 

25473-02 

20656-02 

16342-02 

12527-02 

92125-03 

64025-03 

40970-03 

22980-03 

10040-03 

20237-04 

00000 


•48077-03 

.42499-02 

.10871-01 

.20365-01 

.32752-01 

.46053-01 

.66289-01 

,67481-01 

,11165+00 

..13882+00 

.16901+00 

.20224+00 

,23854+00 

.27792+00 

.32042+00 

,36605+00 

.41484+00 

,46681+00 

,52198+00 

,58038+00 

,64204+00 

.70697+00 

,77521+00 

,64678+00 

.92169+00 

•10000+01 


.23663-03 

•12866-02 

,21743-02 

,29030-02 

•34842-02 

•39273-02 

.42422-02 

.44386-02 

•45267-02 

.45166-02 

.44187-02 

.42434-02 

,40016-02 

.37041-02 

,33619-02 

.29862-02 

,25886-02 

,21806-02 

.17740-02 

,13809-02 

,10133-02 

.68399-03 

,40543-03 

.19082-03 

,54669-04 

,00000 
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V.  EXPERIMENTAL  INVESTIGATION 
OF  OVERLAP  EFFECTS 


An  experiment  has  been  conducted  to  study  the  effect  of  overlap  of  data  on 

the  estimate  |  2.  The  analytical  results  presented  earlier  relate  only  to  the 

case  of  independent  segments  (that  is,  the  case  of  zero  percent  overlap).  This 

experiment  examines  the  effect  of  different  amounts  of  overlap  on  bias  and 
I  /\\2 

variance  of  |  7  | 

Intuitively,  it  seems  that  the  application  of  nonoverlapping  weighting  func- 

A  1 2 

tion  does  not  make  the  best  use  of  the  data  when  forming  the  estimator  y  (f^)  | 

This  inefficiency  is  similar  to  the  wastage  in  forming  auto  power  spectral 
density  functions  shown  by  Nuttall.  **  When 
lap,  larger  bias  and  larger  variance  result  than  when  |  y  (ffc)  j  is  formed  from 
the  same  data  with  overlap.  Because  this  inefficiency  can  not  be  permitted  in 
many  practical  situations  of  interest  (for  example,  underwater  acoustic  environ¬ 
ments),  it  is  desirable  to  know  how  much  the  bias  and  variance  can  be  reduced 
and  at  what  expense  this  reduction  can  be  achieved. 

V.  A.  METHOD 

The  method  of  achieving  the  desired  objective  is  straightforward  in  concept. 

Data  are  generated  with  an  accurately  prespecified  value  of  magnitude- squared 
2 

coherence,  y  ,  which  is  independent  of  frequency,  f  .  Since  the  data 


is  formed  without  over- 
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have  been  generated  so  that  the  magnitude-squared  coherence  is  independent  of 
frequency,  the  sample  mean  and  variance  of  j  y')2  can  be  empirically  deter¬ 
mined  for  the  given  overlap  by  averaging  over  frequency.  These  data  can  then 
be  reprocessed  at  several  different  overlaps  to  form  estimates  of  bias  and 
variance. 

V.  A .  1 .  Data  Generation 

Consider  the  zero-mean,  wide-sense  stationary,  Gaussian  waveforms 
n  (t)  and  n  (t)  that  are  statistically  independent  and  have  power  spectral 

X  M 

density  functions  $  (i)  and  $  (f) ,  respectively.  Statistical  independence 

11  2  2 

dictates  that  they  be  uncorrelated;  that  is, 

Rn  n  (r)  =  E  [”l(t)  n2(t  +  T)]  =  0  *  <5-  V 

In  order  to  generate  two  processes  with  magnitude-squared  coherence 

31 

independent  of  frequency,  let  (Nuttall  and  Carter  ) 


(5.4) 
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Expanding,  dropping  terms  that  go  to  zero,  and  taking  the  Fourier  transform  of 
Eq.  (5.4)  yields 


$  (f)  =  Ckl>  (f )+C*  n  (f)  . 

xy  nini  n2n2 


(5.5) 


The  autocorrelation  of  x(t)  is 


Rxj£(r)  =  E  |[n1(t)  +  Gn2(t)j  [n^t  +  r)  +  (hyt  +  r  )J  J  . 


(5.6) 


Expanding,  dropping  terms  that  go  to  zero,  and  taking  the  Fourier  transform  of 
Eq.  (5.6)  yields 


<Z>(V  =  <t>  _  (f)  +  G  *  (f) 

xx  n1n1  V2 


(5.7) 


Similarly, 


<J>  (f)  =  cj>  (f)  +  G  <J>  (f)  . 

vyy  “2n2  nini 


(5.8) 


Thus,  the  magnitude- squared  coherence  between  x(t)  and  y(t)  is 


y  (f)  = 

xy' 


GV,+GV) 


*  (f)  +  G2 *  (f)l  (f)  +  G2<J>  (f) 

-  nini  .  n2n2J  L  n2°2  nini- 


(5.9) 


Now  introducing  the  assumption  that  <f>  (f)  =  <f>  (f)  =  (J>  (f)  , 

nl“l  n2n2  "" 
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>  (f)|2  = 

xv  I 


4G2  <J)2  (f) 
nn' 


(i  ♦  of  *>  (» *  o2)2 


(5. 10) 


which  is  independent  of  frequency. 

In  order  to  prespecify  a  desired  magnitude- squared  coherence,  j  y  | 2  , 
between  x(t)  and  y(t)  ,  the  gain  G  of  Eqs.  (5.2)  and  (5.3)  must  be  selected 


by  solving  Eq.  (5.10): 


->/h 


°<  rdj<  i 


(5.11) 


Yd  =0.0 


I  ' 


Under  the  assumptions  made,  a  prespecified  desired  value  for  magnitude- 

squared  coherence  can  be  generated.  Because  the  generated  processes  will 

later  be  used  to  empirically  determine  a  very  snv  11  quantity  (bias), it  is 

important  that  the  generated  value  of  coherence  is  indeed  the  desired  value.  In  the 

actual  generation  of  two  processes,  the  assumption  <X>q  q  (f)  =  ^  (f)  may  be 

11  2  2 

violated;  therefore  it  becomes  important  to  determine  how  sensitive  Eq.  (5.10)  is 
to  this  assumption.  Consider  then 


Vi(f) 


=  1  +  A  (f)  =  1  . 


(5. 12) 


c«gwwwi>g»»w 


It  is  easily  shown  by  substituting  Eq.(5. 12)  into  Eq.  (5. 9)  that  the  value 

i  2 

of  magnitude-squared  coherence  generated,  |  >g  ,  is 


y  2_  I  2  l«t>U 

6  "  n-**^2  •,  I2  ' 


(5. 13) 


where  1 7,  j2  =  desired  value  of  ]  7 12  ,  and  the  dependence  of  f  is  dropped 


for  convenience. 


The  error  in  the  generated  value  is 


2  12  2 
7g  “  7dl  '  7d 


("Nl-r: 


i  a2 


A  +  4  |  -yd  j2  A  2 


.  (5. 14) 


Evaluation  of  Eq.  (5, 14)  to  third  order  in  A  yields 


2  2  i  2 

\  -  yi  =  Td 


f1-  bdl2)1 


i  A  0-4! 


(5. 15) 


This  quantity  is  maximum  at  pj  =2  and,  hence,  the  maximum  error  is 

I  1 

approximately 


Max  error 


sH 


(i-A)  . 


(5. 16) 


Therefore,  for  example,  when  A  =  0.01,  the  maximum  error  is  approx- 

_g 

imately  6  x  10  ;  for  A  =  0.05  ,  the  maximum  error  is  approximately 

-4  I  2  31 

1.5  x  10  .  (A  table  of  errors  versus  7^  and  A,  as  computed  from 


Eq.  (5. 14),  -ields  results  similar  to  the  given  approximations.) 


The  processes  generated  according  to  Eqs.  (5.2)  and  (5.3)  have  been 
shown  to  be  relatively  insensitive  to  minor  differences  in  the  power  spectral 
density  functions  of  the  original  uncorrelated  waveforms  n^t)  and  n2(t)  . 

The  procedure  for  generating  variable-coherence  time  series  can  briefly 
be  summarized  as  follows:  One  Gaussian  noise  source  uncorrelated  from 
point  to  point  was  used  to  generate  a  thne-limited  sample  function  of  n^(t)  and, 
later,  of  n  (t)  .  (This  method  eliminates  the  need  for  two  identical  filters.) 

The  waveforms  were  band-limited  using  a  low-pass  filter  and  digitized.  The 
digital  data  were  then  stored  on  magnetic  tape  in  a  format  compatible  with  over¬ 
lapped  processing.  Digital  versions  of  x(t)  =  n^t)  +  Gn2(t)  and  y(t)  =  n2(t) 

+  Gn  (t)  were  generated  from  digital  versions  of  n  (t)  and  n  (t)  for  two 
1  1  " 

values  of  j  y  | 2  .  (Investigation  for  other  values  of  true  magnitude-squared 
coherence  appeared  to  be  unnecessary. ) 

A  Hewlett  Packard  Noise  Generator,  Model  No.  HP3722A,  was  used  for 
data  generation  with  the  following  settings: 

Sequence  Length:  Infinite 

Bandwidth:  5  kHz 

Gaussian  rms:  0.6  x  3. 16  volts  (open  circuit). 

The  output  power  density  function  is  flat  to  within  +  0.3  dB,  provided  the 

input  power  voltage  fluctuates  no  more  than  _+  10  percent.  This  corresponds 

-2 

to  a  A  ,  in  Eq.  (5. 12),  of  7. 152  x  10  and  a  maximum  error  in  the  generated 

-4 

magnitude-squared  coherence,  from  Eq.  (5.16),  of  3  x  10  .  For  example,  if 

2  2 
“y ^  =  0.5000  one  could  expect  0.5004  >  >  0.4996,  making  it 
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impossible  to  measure  extremely  small  bias. 

The  data  were  low- pass  filtered  through  a  two-section  Krohn-hite  filter, 
which  was  6  dB  down  at  1200  Hz  and  rolling  off  at  96  dB  per  octave.  These 
band-limited  data  were  A/D  converted  with  a  Control  Data  Corporation  (CDC) 
15-bit  converter.  Sampling  was  done  at  fg  =  4096  Hz. 

V.A.2.  Analysis  Program 


The  FORTRAN  program  (coded  by  G.  C.  Carter,  C.  R.  Arnold,  and 
J.  F.  Ferrie,  of  NUSC)(1)  implements  Eq.  (3. 11);  (2)  generates  data  with  known 
coherence  from  two  incoherent  sources  according  to  Section  V.  A.  1;  and  (3)  com¬ 
putes  the  sample  mean,  bias,  and  variance  of  the  estimator,  as  described  below. 

A  summary  flowchart  of  the  program  is  presented  in  Fig.  27.  The  cosine 

32 

weighting  function  was  coded  by  A.  H.  Nuttall,  of  NUSC.  Singleton  >  oded 
the  mixed  radix  FFT.  The  FFT  size  used  was  4096  data  points  (lsec),  which 
yields  2048  positive  frequencies  and  direct  current.  Frequencies  beyond  1000  Hz 
were  discounted  in  making  estimates  of  bias  and  variance  to  protect  against 
(1)  unknown  noise  in  the  digitizing  system  and  (2)  difference  in  the  two  auto 
power  spectral  density  functions. 

Estimates  of  the  bias  are  performed  according  to 


/s 

Bias  = 


1 


1000 

E 

1000  k=l 


(5.17) 


and  estimates  of  the  sample  variance  according  to 


KAO  INPUT 
PARAMETERS 


Fig.  27.  Summary  Flow  Chart  for  Thesis 
Version  of  FFT  Spectral  Density 
Estimation  Program 
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A  t  f  I  -A  I  2  ^  1  2 

Var  =  -  53  >  (ffc)  -  Bias  I  .  (5.18) 

999  k=l  J 

Results  of  the  experiment  are  described  in  the  next  section. 

V.  B.  RESULTS 

Results  of  the  experiment  for  a  joint  set  of  data,  each  (32  x  4096)  sampleb 

long,  are  included  in  tabular  and  graphic  form.  Confidence  bands  for  the 

estimates  of  bias  can  lie  determined  from  the  estimates  of  variance.  However, 

it  must  be  realized  that  1000  samples  (frequencies)  were  used  to  determine  the 

average,  and  that  each  sample  is  correlated  to  the  extent  of  approximately  0.5  with 

neighboring  estimates  (empirical  results).  This  agrees  with  analytical  results 

14 

provided  for  auto  spectral  estimates. 

It  is  apparent  from  the  results  (Tables  6  and  7  and  Figs.  28  through  31) 

l  /"M  2 

that  the  bias  and  variance  of  |  >  |  can  be  reduced  through  the  use  of  over- 

II  2 

y  |  =o.0,  the  variance  of  the 

estimator  with  50-perce  ♦.  overlap  equals  31  percent  of  the  variance  of  the 

estimator  with  0-percent  overlap.  With  50-percent  overlap,  the  bias  is  55  per- 

I  1 2 

cent  as  large  as  with 0-percent  overlap.  Similarly,  when  |*y  |  =0.3  ,  the 
variance  is  55  percent  of  the  0-percent  overlap  estimator,  and  the  bias  is 
50  percent  as  large.  It  also  can  be  'jeen  from  the  results  that  62.5-percent 
overlap  is  similar  to  having  processed  twice  as  much  data  with  0-percent  over- 
lap.  There  is  one  possible  exception:  The  bias  for  |>  |  =  0.3  is  36  percent 

as  large  as  the  0-percent  overlap  estimator.  This  is  better  than  50  percent, 
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TABLE  6 

EMPIRICAL  RESULTS  FOR  M  2  =  0. 0  AND  n  =  32 


Percent 

Overlap 

No.  FFTs 

Bias 

Variance 

0.0 

32 

.  3156  x  10"1 

.  9463  x  10“3 

12.5 

36 

.  2834  x  10"1 

.7130xl0-3 

25.0 

42 

.  2370  x  10”1 

.5197  x  10“3 

37.5 

50 

.  2043  x  10_1 

.  4069  x  10“3 

50.0 

63 

.  1749  x  10-1 

.  2929  x  10*3 

62.5 

83 

.1582  x  10“ 1 

.  2480  x  10-3 

75.0 

125 

.  1571  x  10-1 

.2463  x  IQ"3 

TABLE  7 

EMPIRICAL  RESULTS  FOR  M2  =  0.3  AND  n  =  32 


Percent 

Overlap 

No.  FFTs 

Bias 

Variance 

0.0 

32 

1.442  x  10“2 

.  8007  x  lO”2 

12.5 

36 

1. 093  x  10"2 

.  7770  x  lO-2 

25.0 

42 

. 959  x  lO-2 

.  5965  x  lO-2 

37.5 

50 

.  71?  x  10“2 

.  5067  x  lO"2 

50.0 

63 

.  597  x  10“2 

.4441  x  lO-2 

62.5 

83 

| 

.  515  x  10"2 

.4063  x  lO"2 

75.0 

125 

.  494  x  lO"2 

.4020  x  10-2 

which  would  be  expected  from  twice  as  much  data. 

Quite  naturally,  there  is  an  increase  in  computational  cost  associated  with 
overlapped  processing.  Specifically,  the  number  of  FFTs  to  be  performed 
(a  measure  of  the  computational  cost)  increases  with  the  percent  overlap  spec¬ 
ified  ( Fig.  32).  The  number  of  FFTs  required  for  50-percent  overlap  is  ap¬ 
proximately  twice  the  number  for  0-percent  overlap. 

Increasing  the  overlap  from  50  percent  to  62. 5  percent,  requires  32-per- 
cent  more  FFTs,  but  the  variance  of  the  estimator  becomes  only  80-95  percent 
of  its  value  at  50-percent  overlap.  In  most  cases,  the  improvement  to  be 
derived  from  using  62. 5-percent  overlap,  as  opposed  to  50-percent  overlap, 
will  not  warrant  the  increased  computational  costs,  and  should  be  used  only 
when  stringent  variance  and  bias  reduction  requirements  are  demanded. 


VI. 


CONCLUSIONS 


A  detailed  analytical  analysis  of  the  statistics  for  estimating  the  magnitude- 
squared  coherence  function  (spectrum)  has  been  made.  When  such  estimates 
are  made,  time-limited  sample  functions  of  long  duration,  which  are  stationary 
(in  the  wide  sense)  over  the  period  of  observation,  must  be  available.  Expres¬ 
sions  for  the  probability  density,  the  cumulative  distribution,  and  the  bias  and 
variance  of  |  7  |  have  been  presented  for  the  case  where  no  overlap  process¬ 
ing  is  used.  Evaluation  of  these  expressions,  which  are  dependent  on  both  the 
true  value  of  coherence  and  the  number  of  observed  segments,  n,  dramatically 
portrays  the  requirement  that  n  be  large. 

The  application  of  a  cosine-weighting  function  in  order  to  reduce  errors  due 
to  sidelobe  leakage  wastes  the  available  data.  As  shown  empirically,  proper 
use  of  the  data  in  terms  of  reduced  bias  and  variance  of  the  estimator  can  be 
achieved  through  overlapped  processing.  It  appears  that  a  62. 5-percent  over¬ 
lap  is  roughly  equivalent  to  having  twice  as  much  data  available.  The  reduced 
bias  and  variance  of  the  estimator  achieved  through  62.5-percent  overlapped 
processing  can  be  realized  almost  entirely  through  a  50-percent  overlap.  The 
computational  cost  associated  with  50-percent  overlap  is  not  unreasonable. 

With  50-percent  overlap,  variance  and  bias  reductions  are  achieved  that  are 
similar  to  reductions  resulting  from  processing  twice  as  much  data  with  0-percent 
overlap.  This  significant  gain  to  be  obtained  from  50-percent  overlap  processing 


should  not  be  overlooked  in  estimating  the  magnitude- squared  coherence  function 
(spectrum)  when  cosine  (Hanning)  weighting  is  used  and  data  are  limited. 
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APPENDIX  A 


STATISTICS  OF 

MAGNITUDE- COHERENCE  ESTIMATOR 


Goodman,  in  his  Eqs.  (4. 51)  and  (4, 60X  derived  an  analytical  expression 
for  the  probability  density  function  of  the  coherence  estimate,  J  'y  J  . 1  His 
results  are  based  on  two  zero-mean  processes  that  are  stationary,  Gaussian, 
and  random  and  are  segmented  into  n  independent  observations  (that  is, 
n  nonoverlapped  segments).  Each  segment  is  assumed  perfectly  windowed, 
as  defined  in  Chapter  IV.  The  probability  that  the  estimate  of 


coherence  would  take  on  some  value,  |  y  j ,  conditioned  on  the  true  coherence 


being  equal  to  )  y  and  upon  n  independent  observations,  was  given  by 


Goodman  as 


p(|?||!,|,„)  J£!ki!3  (1.|^|*j 

x  7  T  (n)  r  (n  -  1)  '  7 


®  r  (n  +  k) 
k=o  r2(k  +  i) 


(M2I?|2)  . 


(A.l) 


where  T(n)  is  the  Gamma  function,  namely, 
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—x  n—  1 

e  x  ~  dx  and  T(n)  =  (n-1)  r(n-l)  . 

u 

24 

This  probability  density  function  is  also  given  by  Hannan.  However,  this 

form  of  the  density  function  is  rather  cumbersome.  A  work  by  Enochson  and 
9 

Goodman  suggests  that  it  may  be  written  in  terms  of  the  hypergeometric  function. 

26 


It  is  first  necessary  to  observe,  from  Abramowitz  and  Stegun,  that 

/  i  00  r2<»  +  k>(M2l^l2) 

F  (n,n;  1;  r|2|  v|2)  -j-  £  -  ' - .  <A.2> 

i\  '  r^n>k=n 


r  (n)  k=0  r(k+l)  kl 


Further,  when  k  is  an  integer, 


k  !  =  T(k  +  1)  . 


(A.  3) 


26 

(See,  for  example,  Abramowitz  and  Stegun.  )  Thus,  Eq.  (A.  1)  is 


p(|  n|Kl,  n)^2|^i(i-M2)  fi -It  I2)  — — 

'  /  '  •  '  '  T(n\  rtv 


T(n)  T(n) 


co  r 


J(n  +  k)  (ItI^I2)1 


0  r(k+  l) 


k  1 


or,  more  simply,  substituting  Eq.  (A.2)  into  (A. 4)  , 


.(|^|||v|,n)  =  2|^|(l-|T!2)  H*'!2)  ( 


(n-  1) 


(A. 4) 


•2Fi(n*  n;i;l>i2l^l2)  * 


(A.  5) 
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Using  Eqs.  (A.  7)  and  (A.  8)  yieiCe 


V  M2a(B1  +  i-.)2  =  B^2B1|r  |*| 


(A.  10) 


Thus, 


B  >  ,'  %  y  +  B,  . 
2  ~  j.  i  1 


(A.  11) 


For  example,  conside'  the  case  j  7  j  -  1.0  .  Now  B^  =  0.0  ,  =  0.0  , 

and  Eq.  (A.  11)  hold  with  -quality.  Consider  also  |y  I  =  0.0  .  Then 


■(Pl|-.M-°.«)- 


r(n)  ro/2) 


T(n  +  1/2) 


(A.  12) 
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Using  Eq.  {6. 1.47)  of  Abramowitz  and  Stegun,  Eq.  (A.  12)  yields  for  large  n 

r  (3/2) 


■  (Ml*- 1>  1-0.°)“ 


\R 


=  h  ^*/n  , 


(A.  13) 


For  y  2  =  0  ,  Eq.  (4.21)  gives 


V1  ’  I  7|2  =  o.o. 

n 


(A.  14) 


Thus,  th£  inequality  holds  and  Eq.  (A.  11)  becomes 


I  >  JL 


(!) 


*>  =0.0  . 


(A.  15) 
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APPENDIX  B 


J 

I 


i 

A 


DERIVATION  OF  A  SIMPLIFIED  EXPRESSION 
FOR  THE  EXPECTATION  OF  THE 
ESTIMATE  OF  MAGNITUDE- 
SQUARED  COHERENCE 


The  major  steps  in  deriving  a  simplified  expression  for  E^j 2  J  n ,  j  7  | 
are  presented  here. 

According  to  Eq.  (4.26), 


/|/N|2l  i  |2\ 
e(|7  i  j n,  7 1  )= 


(t-ny 


n 


F2(^2,  n,n;  n+  1,  1;K  |2)  .  (B.  1) 


which  can  be  manipulated  into  the  form 


(w’kMyi-M2)"  2  — +1)  M 

'  '  '  k  =  0  (n  + 


J  2k 


k)  k  I 


(B.  2) 


Adding  and  subtracting  n  from  the  numerator  term  in  Eq.  (B.  2)  yields 


CD 

£ 

k=0 


^k 
k  ! 


(1  -  n)(n)k 
(k  +  n)  k  ! 


j 


Recognizing  that 


(B.3) 


i  (n)k 
k  +  n  n(n  +  1)^ 


(B.  4) 
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it  follows  that 


«{*l2|..Mt)-(i-Mt)‘ 


1-17!2 


00  (n)k(b)k  Ul 2  <i-n) 

n 


k=0  (b), 


k  ! 


g  (n)k  <n)k 


,!2k 


k=0  (n  +  1). 


k  I 


(B.S) 


In  terms  of  F  functions,  Eq.  (B.  5)  becomes 


!0 


El  Jt'|2  In,  ?|2 


)=(l-|Y|2)”f2F>(n'b!b;|1,12) 


(1-n) 


—  2fi  (e»  n;  n  +  i; 1  y  j 


(B.  6) 


2fi 

By  using  Eq.  (15. 1. 8)  of  Abramowitz  and  Stegun,  Eq.  (B.6)  reduces  to 


pi2|n.M2)=(l-M2)"  [(i-M2)'” 

+  -  8F1^ntn;n  +  Is  |  y  |2)j. 


(B.7) 


Simplifying  and  applying  Eq.  (15. 3. 3)  of  Abramowitz  and  Stegun, 26  Eq.  (B.  7) 
can  be  further  reduced  to 


1 2  J  I  I 

A  <1-n>/  . 

1  |n,M 

r1+  l1-7 

'  n  '  1 

(B.  8) 
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26 

Finally,  by  applying  Eq.  (15.2.6)  of  Abramowitz  and  Stegun,  with  a  =  1, 
b  =  1 ,  and  c  =  n  +  1  ,  Eq.  (B.  8)  can  be  manipulated  into  the  form 


1  +  111 


( 


1 ,  1 ;  n  +  2 ; 


!V  (B.9) 
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